Detecting unstructured events through multi-source data analysis
نویسندگان
چکیده
There is increasing interest in using urban data analysis for different applications, particularly in relation to urban management, transport optimisation and fields related to cities becoming ‘smarter’. Data used in the analyses comes from different sources and involves various domains, from transportation engineering to urban planning. Three forms of data in particular have been of interest within the research community for a number of years. The first one includes data sourced from what are called ‘smart cards’ (for ticketing purposes): mainly used for calculating commuting flows and tracking users trajectories (Munizaga and Palma 2012; Hasan et al. 2012), it also has some applications for urban zoning and activity detection (Roth et al. 2010; Zhong et al. 2014). The second type concerns mobile phone data, generally aggregated and anonymised by providers, used mainly for detecting urban dynamics and activities because of the fine granularity, considering mobile phone as a proxy for each single user. Research focus has been similar from pioneering works till recent ones (Ratti et al. 2006; Candia et al. 2008; Calabrese et al. 2011; Hasan et al. 2012). Finally, the third one involves social media data, which use increased in latest years mainly because of the users’ growth and its availability through API. Spatial applications are particularly relevant when data attributes include location information, as seen in Twitter (Hawelka et al. 2014), or Foursquare (Cranshaw et al. 2012) data. In this paper we will discuss how a deeper understanding of urban dynamics can be yielded through the joint analysis of all three datasets. Specifically we seek to develop a platform for the identification of events, with a view to informing urban planning and management.
منابع مشابه
Unsupervised Spatial Event Detection in Targeted Domains with Applications to Civil Unrest Modeling
Twitter has become a popular data source as a surrogate for monitoring and detecting events. Targeted domains such as crime, election, and social unrest require the creation of algorithms capable of detecting events pertinent to these domains. Due to the unstructured language, short-length messages, dynamics, and heterogeneity typical of Twitter data streams, it is technically difficult and lab...
متن کاملA Lightweight Approach to Uncover Technical Information in Unstructured Data
Developer communication through email, chat, or issue report comments consists mostly of largely unstructured data, i.e., natural language text, mixed with technical information such as project-specific jargon, abbreviations, source code patches, stack traces and identifiers. These technical artifacts represent a valuable source of knowledge on the technical part of the system, with a wide rang...
متن کاملA Wavelet-Based System for Event Detection
Sensors are increasingly being used for continuous monitoring purposes, the process of which generates huge volumes of data that need to be mined for interesting events in realtime. The purpose of this research is to develop a method to identify these events, and to provide users with an architecture that will allow them to analyze events online and in real-time, to act upon them, and to archiv...
متن کاملDiagnostic tools for 3D unstructured oceanographic data
Most ocean models in current use are built upon structured meshes. It follows that most existing tools for extracting diagnostic quantities (volume and surface integrals, for example) from ocean model output are constructed using techniques and software tools which assume structured meshes. The greater complexity inherent in unstructured meshes (especially fully unstructured grids which are uns...
متن کاملBuilding Multi-Resolution Event-Enriched Maps From Social Data
This paper discusses the next generation of digital maps, by positing that maps in future will intelligently self-update themselves based on distinctive events extracted dynamically from social media streams or other crowd-sourced data. To realize this concept, the challenges include developing a scalable and efficient system to deal with a variety of unstructured data streams, applying NLP and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015